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ABSTRACT 



Aims. We analyze different data of the variation of the fine structure constant obtained with different methods to check their 
consistency. 

Methods. We test consistency using the modified student test and confidence intervals. We split the data sets into smaller 
intervals. A criterion for this selection is proposed. 

Results. Results show consistency for reduced intervals for each pair of data sets considered. 

Conclusions. Results are at variance with the ones obtained considering mean values over the whole interval. 
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1. Introduction 

The attempt to unify all fundamental interactions resulted 
in the development of multi dimensional theor i es like 
string- motivated field theories rtWu fc Wandll986l; iMaed. 



198 



Barr fc Mohapatrdll988l iDamour fc Polvakovll99 
relat ed b r ane-world theorie s 
20031: iBrax et alj l20Q3h 



Damour et alJ l2002allj) 
(|Youml l206Talrbl : IPalma et al . 

and ( related or n o t) Kaluza - Klein theories dKaluza 



1921 

Il9851 
ries. 



Gleiser fc Taylor 



Klem] Il926t IWeinberd fl98 

lOverduin fc Wesson! 1 19971 ) . Among these theo- 
there are some in which the gauge coupling con- 
stants may vary over cosmological timescales. On the 
other hand, theoretical frameworks based o n first prin- 
ciples , were developed by di f ferent authors dBekenstein 



19821 l2002t IChamoun et al 
Olive fc Pospelovl 12002^ 



2001; 



Barrow et al. |_|2002l 
Barrow fc Magueiiol |2005( ) to 



study the variation of the fine structure constant (a) or 
the electron mass (m e ). 

Different versions of the theories mentioned above pre- 
dict different time behaviors of the fundamental constants. 
Thus, bounds obtained from astronomical and geophysi- 
cal data are an important tool to test the validity of these 
theories. 

The experimental research can be grouped into as- 
tronomical and local methods. The latter ones include 
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geophysical methods such as the natural nuclear reac- 
tor that operated about 1.8 1 9 years ago in Oklo, 
Gabon (jPamour fc Dvsonl [1991 iFuiii et all l2000h . the 
analysis of natural long- lived (3 decayers in geological 
minerals and meteorites (jOlive et al.l [2004) and labora- 
tory measurements such as comparisons o f rates between 



clock s with differe nt at omic numbers (Presta ge et al 



19951: ISortais et all l2000t iMarion etaD 120031: iBize et al 



20031 : iFischer et al.ll2004 iPeik et al.ll2004r i. The astronom- 
ical methods are based mainly on the analysis of high- 
redshift quasar absorption systems. The relative magni- 
tude of the fine splitting of resonance lin e s of alk aline 
ions is proportion al to a 2 . Murphy et al. ( 2001bl ) and 
Chand et all (|2005h have applied this method to SilV dou- 



blet absorption lines systems at different redshifts. An 
extens ion of this method was proposed by iBahcall et al 



(2004). These authors use strong nebular emission lines 
of O III to constrain the variation of a. Furthermore, 
this method was improved by comparing transitions of 
different species with widely far atomic masses and led 
to the first results consistent with a time varying fine 
struc ture constant for a range of redshifts (0.5 < z < 
3.5) (jWebb et al.l I1999L l200lUMurphv et aDl2003l) . This 
method is known in the literature as the many multiplct 
method (MM). Howe ver, other recent independent anal- 
yses of similar data (jSrianand et al. 2004 ; Chand et al 



2004 120061 iQuast et al.ll2004 found no variation. On the 



other hand, the standard MM technique can be revised 
to avoid the deficiencies pointed out earlier in the liter- 
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ature (jBahcall et all 12004 iQuast et all 120041 ) . This im- 
proved method called in the literature as r evised many 
multip let (RMM) method, was applied by IQuast et al. 
( 20041 ) to a homogeneous sample of Fell lines at redshift 
z = 1.15. Another method, to test cosmological varia- 
tion of a, from pairs of Fe II lines observed in individual 
exposure s from a high-re s olutio n spectrograph was pro- 
posed bv lLevshakov et all ( 20051 ) (this method is known in 
the literature as SID AM). The au thors found no variatio n 
of a at z = 1.84 and z = 1.15 (jLevshakov et al.ll2006t) . 
However, a recent reanalysis of spectru m of the quasar 
Q110 1-264 found variability within lcr ( Levshakov et al.l 
20071 ). Besides, by comparing optical and radio redshifts, a 
bound on a 2 g p ^ (where g v is the proton g factor) can be 
obtained ( Cowie fe Songailal 19951 : Tzanavaris et al 1l2007t ). 
Furthermore, comparing molecular and radio lines pro- 
vides a bound on Q pO 2 and the m o st strin gent constraints 



were obtained by iMurphv et al.l (|2001af ). On the other 



hand, I Par rind (|2004j ) reports bounds on the variation of 
a at z = 0.2 467 from the satellite 18 cm OH conjugate 
lines. Finally. [Kanekar et al. I (|2005h compared the HI and 
OH main line absorption redshifts of the different compo- 
nents in the z = 0.765 absorber and the z = 0.685 lens 
toward B0218- 

changes in F 



-357 to establish stringent constraints on 

1.57 

. Besides, the time varia- 



9p 



( a 2 m e \ 



tion of the gauge coupling constants in the early universe 
can be constrained usi ng data from the cosmic microwave 



backg round (CMB) (jMartins et all 120021: iRocha et al" 



2003; ) and the primordial abundances of light e lements 
(jlchikawa fe Kawasakill2002l ; iNollet fe Lopedl2"o"02h . 

In this paper, we would like to focus on the discrep- 
ancy between the data on time variation of a from as- 
tronomical observations, following a purely statistical cri- 
terion. As described above, different methods are able to 
constrain the variation of a. However, to achieve statis- 
tical consistency for each fixed redshift interval, the re- 
ported value should be the same. The usual assumption 
is that if there is any time variation of a, such variation 
is constant over all observed redshifts. So, the reported 
results are means over a range of redshifts. However, the- 
ories mentioned above predict different time evolu tions of 
a yielding different variations for diffe rent times (jOkada 
19851 : lMarcia^olll984 lBekensteinlll982l ) . Therefore, to test 
the evolution of a predicted by those kind of theories, 
not only a mean value over a range of redshifts, but each 
individual measurement should be considered. However, 
not all the reported results are consistent. The most im- 
portant di screpancy in the liter ature are the results re- 
ported by Murphy et al. ( 20031 ) and those reported by 
Chand et al .1 (|2004f) . It should be also mentioned that 



there is an imp ortant debate in the literatur e about the re- 



liabil i ty of the IChand et al] (2004) results dMurphv et al 
20071 : ISrianand et al.ll2007l : fMurphv et!d~ll2008l ). The aim 



of this paper is to test the consistency of different reported 
values for the variation of a for smaller intervals than the 
reported ones, in each case. In this way, we will be able to 
define a reduced interval where consistency can be assured 



and thus the theoretical prediction for the evolution of a 
with time within this interval can be tested. The selection 
of the redshift interval to be tested is not trivial and a 
method for this selection is also proposed and described 
in Sects. [2] and [3] It is also important to note that obser- 
vational errors are important and can not be ignored in 
any statistical analysis. In Sect. [2] we describe the statis- 
tical tools (which include the observational errors) we use 
for testing consistency. In Sect. [31 we apply these tools to 
check consistency between the data on varying a. Not all 
groups of data can be tested using the modified student 
test because the requirements of the method for selecting 
the redshift intervals can not be fulfilled by most groups 
of data. Therefore, we compute confidence intervals of a 
group of data and compare them with a single reported 
value of another author to check consistency in most of 
the cases. We find that consistency between pairs of data 
sets can be assured for a reduced interval in each case. In 
Sect. |4j we discuss our results and write our conclusions. 

2. Statistical tools 

The question to be answered is whether, for given redshift 
intervals, two experiments compared are consistent or not. 
The appropriate procedure is thus a test for the difference 
between two population means, which involves a statis- 
tic defined in terms of two sample means and two sample 
variances. However, in some cases, one of the experiments 
includes, for a given redshift interval, very little data, and 
therefore does not allow us to reasonably define a sample 
mean and a sample variance. In this situation, the pro- 
cedure to be followed should rather involve a confidence 
interval constructed from the sample values of the experi- 
ment allowing for a statistical treatment. In what follows, 
we discuss in detail these two approaches as well as the 
choice of the sample size (which, consequently, determines 
the width of the redshift intervals). 

2.1. Student test 

Within a statistical framework, the null hypothesis (two 
experiments are consistent) can be formulated as 



H : - fi 2 = 



(1) 



where /ii, /12 are the (unknown) population means of each 
experiment for a definite redshift interval. Because the 
sample sizes for the available observational data are not 
expected to be large, and the true variances are not known, 
to test the hypothesis we must use a t test, which involves 
the sample variances. Now, the usual t test is not robust 
to departures from normality or from equality of variances 
when the number of values within each sample are not 
equal. Thus, we adopt an approximate test starting from 
the statistic (DevoreJ|l995): 



T 



X\ — X-2 





st 
V — 


V m 


n 



(2) 
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where X\, X2 are the sample mean values for the given 
redshift interval, m, n are the data numbers of each sam- 
ple and Si,S% are, respectively, their sample v ariances 



corre cted to include the observational errors e% ((Brandt 
Il989h : 

(3) 



s- 



E 



Pi [<$ + (xi-X) 1 



(4) 



The rejection region (RR) for the two-tailed test is defined 

by 

(T<-U v 
RR.l 2 ' (5) 

where the number of degrees of freedom v is given by the 
rounded value of 



S\ S\ 
— + — 

m n 



{Sl/mf , (Sl/nf 



(G) 



1 



1 



and A is the (approximate) level of the test ( Brownleel 
196fj ). Thus, A is the approximate probability of Type I 
error, that is, the probability of rejecting the null hypoth- 
esis when it is true. In practice, the algorithm used will 
yield a level value A* such that the obtained value of the 
statistic lies within the associated rejection region (Eq. ([5]) 
with A*). Therefore, at level A the null hypothesis should 
be rejected when A* < A. 

2.2. Confidence Intervals 

In some cases, one of the experiments to be tested includes 
very few data and, therefore, does not allow us to define 
a sample mean and a sample variance for a given redshift 
interval. In other cases, the amount of data is statistically 
very low to consider the results of the modified student 
test reliable. For those cases, we introduce a different pro- 
cedure: Suppose that for a given redshift interval a group 
of data 1 allows for a statistical treatment, while a group 
of data 2 does not. To test the consistency of given obser- 
vation 2 against observation 1 , from the values of group 1 
we construct an interval I of confidence 100 P%. Then, if 
the null hypothesis is true, P = 1 — A is the probability 
that the result of an observation of group 2 lies within this 
confidence interval, and the null hypothesis should be re- 
jected at level A when this is not the case. The confidence 
interval is then centered at the mean value X of sample 
1, and its width is determined by the complement of the 
rejection region of a two-tailed test. Thus, under the same 
hypothesis of the preceding subsection, we have 



I : 



X — t\ „_i— —;X + t\_ 1— = 



where n is the number of values of sample 1, and S 2 is the 
sample variance (corrected as above to include the obser- 
vational errors); as before, the choice of the t distribution 
is motivated by the size of the samples, which are not ex- 
pected to be large. In practice, we choose a level A and the 
algorithm yields a confidence interval for this level. Then, 
we compare the confidence interval obtained from group 
1 with each single reported value for the variation of a 
obtained from group 2. 

2.3. Sample size 

From a statistical point of view, a possible criterion to 
estimate the appropriate sample sizes is to limit the prob- 
ability of Type II error, that is, the probability j3 of not 
rejecting the null hypothesis when it is false. However, 
while the probability A of Type I error can be fixed in- 
dependent of the population or sample values, the cal- 
culation of (3 requires the choice of a definite alternative 
hypothesis; that is, to determine j3 the inequality /ii ^ fi2 
must be specialized as a definite equality /ii — fi2 = S. 
Then, once we have precisely determined a definite alter- 
native hypothesis and chosen a level j3, in some cases we 
can obtain a simple analytical expression of the required 
sample size n. For a t distribution no simple expression 
exists, which can easily be understood by recalling that 
this distribution includes as a parameter the number of 
degrees of freedom. However if one is only interested in an 
estimate, an approximate analytical expression is given in 
terms of a normal distribution; for a two-tailed test for 
the mean of a population (o r for the assoc iated confidence 
interval) we have (see Ref. ( Devorel 1995t )): 

2 



, S 

ZA + Z/3 ) - 



(8) 



where za, z^ are obtained by inverting a normal N(0,1) 
distribution (this would slightly underestimate the size n, 
because the t distribution is less peaked than the normal 
distribution, but this is not relevant if one is not interested 
in an exact result). In the case that we test the difference 
between the means of two populations, if the correspond- 
ing two samples are of equal (or at least similar) sizes, 
formula © can also be applied, with S = y/ S% + <Sf • I n 
this approach, the choice of the approximate sample size 
is thus determined by the Type II error that one is to ad- 
mit for a given departure from the null hypothesis, this 
departure being measured by comparison with the sample 
variance. In practice, for a proposed level /3 we take as 
a reasonable assumption an alternative hypothesis 5 ~ S 
and obtain the corresponding sample size n; this proves 
to be consistent with usual choices of what is to be con- 
sidered as a significant non null result (see the Discussion 
below) . 

3. Results 



Table [T] summarizes the bounds for direct measurements 
CO of — . Table [2] shows the details of data that constrain 
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a combination of fundamental constants that include a. 
Unifying schemes predict that the variation of fundamen- 
tal constants is related, and that their relationship de- 
pends on the theoretical framework. However, we limit 
ourselves to studying the variation of a and do not con- 
sider the possible variation of other fundamental con- 
stants. We have e xcluded the data fro m qu asar emission 
lines r eported bv iBahcall et al.1 ( 2004 ) and iGrupe et al.l 
( 20051) because the individual errors of those data are 2 
orders of magnitude above errors of other groups of data . 
On the other hand, the d ata reported by iDarlina (|2004l ) 
and Murphy et al. ( 2001al ) at z = 0.24 could not be tested 
either because there are not enough data from other au- 
thors at similar redshift to build confidence intervals. 

One of the major problems of this analysis lies in the 
selection of the redshift interval to be tested. The natu- 
ral choice is a length equal to the observational error of 
the measured redshifts. However, at present there are not 
enough data available to use this criterion. Another pos- 
sibility would be a bin size coming from theories that pre- 
dict time variation of fundamental constants. However, in 
general, those models have free parameters, which are esti- 
mated from observational bounds on varying a. Therefore, 
we decided to use a purely statistical criterion. As ex- 
plained in Sect. 12.31 the amount of data can be determined 
from limits on Type II error. Now, we fix A = 0.025, and 
Eq. E gives n ~ 12, 15, 18 for = 0.1, 0.05, 0.025 respec- 
tively. 

3.1. Student Test 



The selection of redshift intervals to be tested would pro- 
ceed as follows: The first interval (to be considered to 
apply the test) starts at a redshift z — a with b width, 
where b is the minimum length that includes n data (n 
will depend on the desired value of /?, see above) for both 
groups of data or for only one group of data in the case 
of confidence intervals. The following i intervals will start 
at redshift z = a + i * 0.1 and again the value of b is cho- 
sen with the same criterion. For analyzing the results we 
also define a criterion: i) If all values of A* are below the 
desired level for all intervals, we conclude that there is no 
consistency between both groups of data; ii) if all values 
of A* are above the desired level, we conclude that there is 
consistency between both groups of data and; iii) in case 
that some values of A* are below and some values of A* are 
above the desired level, we conclude consistency for a re- 
duced interval. We exclude from the consistency interval, 
all intervals for which A* < A. We can assure consistency 
for the remaining interval. 

We ap plied the modified stude nt test to the data re - 
ported bv lchand etal] ([20051 ) and lMurphv et all (|2001al ) 
for n = 12 (a higher n is not possible since the total data 
from (jChand et al.ll2005l ) are 15). Results showing consis- 
tency for the whole interval are shown in Fig. [TJ It should 
be noted that the modified student test can be applied 
only for redshift intervals larger than 0.76 due to the re- 



quirement that n = 12. Other groups of data could not 
be tested in this way due to two kinds of problems: i) the 
total number of data is lower than 12; ii) The number of 
data of one author for a fixed interval is much larger than 
the number of data of other authors for the same interval. 
This does not fulfill one of the assumptions to obtain Eq. 

m 

3.2. Confidence Intervals 

Since the requirements to apply the modified student test 
are not always fulfilled by the available data, the calcula- 
tion of confidence intervals is a useful tool to test consis- 
tency among data on varying a. We choose A = 0.025 and 
build a confidence interval for a group of data and compare 
the results with each single reported value of another au- 
thor. Again, we choose the bin size as the smallest interval 
centered at the reported value, which contains n data (n 
depends on the desired (3 level) . We also define a criterion 
for analyzing the results. If all confidence intervals overlap 
with the reported value of the other author, we conclude 
that there is consistency for the whole interval, while if 
none of them overlap, we conclude that there is no con- 
sistency. In case there are some confidence intervals that 
do not overlap with the respective reported interval, we 
exclude from the consistency interval the corresponding 
redshift interval. This is a conservative criterion, because 
we are probably overestimating the discarded intervals. 
The available data allow us to perform the analysis for 
n = 18 (P — 0.025), and we show these results because 
the discarded intervals are larger than in the cases for 
higher values of (3. 

Table [3] shows the confidenc e intervals calculated for 
the data of lMurphv et al. ( 20031) for reds h ift int ervals cen- 
tered in each value of the IChand et al. ( 2004 ) data and 
containing 18 data points (we have also calculated the con- 
fidence intervals for other values of n and results are shown 
in Sect. [4]). From Table [3] it follows that the confidence in- 
tervals calculated for z = 1.277, z = 1.541, and z — 1.637 
do not overlap with the corresponding reported intervals. 
Therefore, the redshift interval (1.14, 1.93) should be dis- 
carded from the cons istency interval because the data from 
Murphy et all (|2003h used to calculate the confidence in- 
tervals belong to this interval. Therefore, from the confi- 
dence intervals analysis we conclude that there is consis- 
tency over the interv als (0.452,1.1 4 ) and (1.93,2.3), i.e., 
11 data points from IChand et all d2004h are consistent 
with 70 data points from lMurphv et all (|20o3) (only 103 
of the 128 data po ints are eligible to be tested against 
Chand et al.l (|2004t ) data using the criterion n = 18). 

From Table p i ] it fo llows that the data reported by 



Levshakov et al 



( 20071 ) are at variance wi th the confi- 
dence interval calculated with the data of Chand et al.l 
(|2004l ) at z = 1.84. From Table we obtain that t he con - 
sist ency interval for the d ata sets o f lMurohv et ail (|2001bl ) 
and lMurphv et all (|2003l) is (2.01, 2.09)(2.46, 2.51). From 



Table [6l it follows that the consistency interval for the 
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Table 1. Groups of data considered for methods that constrain — 



Method 
(1) 


Redshift or Redshift interval 
(2) 


Number of data 
(3) 


^ ±a 

a 

(4) 


Reference 
(5) 


Many Multiplet 


0.22 < z < 3.6 


128 


-0.543 ±0.116 


(1) 


Many Multiplet 


0.452 < z < 2.3 


23 


-0.06 ±0.06 


(2) 


Many Multiplet 


z = 1.15 


1 


-0.05 ±0.24 


(3) 


RMM 


z = 1.15 


1 


-0.04 ±0.46 


(4) 


Alkali Doublet 


2 < z < 3 


19 


-0.50 ±1.30 


(5) 


Alkali Doublet 


1.9 <z < 2.8 


15 


0.15 ±0.43 


(6) 


SIDAM 


z = 1.15 


1 


-0.007 ±0.084 


(7) 


SIDAM 


z = 1.84 


1 


0.54 ±0.25 


(8) 



Notes.- Columns: (1) observational method; (2) redshift or redshift interval; (3) number of data; 
(4) mean value of — and the corresponding la error in units of 10~ 5 ; (5) reference. 
References: (1) Murphy et al. (2003) : (2) Chand et al. (2004) : (3) Chand et al. (2006) 
(4) Quast et al. (2004) ; (5) Murphy et al. (2001b); (6) Chand et al. (2005) 
(7) Levshakov et al. (2006) : (8) Levshakov et al. (2007) 



Table 2. Groups of data considered for methods that constrain quantities related to 



Method 


Redshift or Redshift interval 


Quantity Measured 


^±a 

a 


Reference 


(1) 


(2) 


(3) 


(4) 


(5) 


Optical and radio lines 


0.23 < z < 2.4 


o?g P — 


0.32 ± 0.50 


(1) 


Molecular and radio lines 


z = 0.69 


2 

a g p 


-0.08 ±0.27 


(2) 



OH conjugate lines and radio lines 
OH conjugate lines and radio lines 



z = 0.765 
z = 0.685 



/ m e \ 

V m p ) 

/ 2 \ 1 

/ a m e \ 

\ m p ) 



0.27 ±0.27 
0.11 ± 0.13 



(3) 
(3) 



Notes.- The observational methods consist in comparing the redshift obtained with different spectral lines. 
Columns: (1) observational method specifying which lines are compared; (2) redshift or redshift interval; 
(3) quantity measured; (4) mean value of — and its corresponding la error; (5) reference. 
References: (1) Tzanavaris et al. (2007) : (2) Murphy et al. (2001a) : (3) Kanekar et al. (2005) 



data sets of IChand et ail (|2005l ) and lMurphv et all (|2003h 
is (1.33, 1.71)( 2.1, 3.11). The c onsiste ncy in terval for the 



data sets of iMurphv et al. ( 2001b ) and Chand et al 



(l2005h is (1.33, 1.72)(2.09,3. 02) (see Table E). Finally , the 
consistency in t ervals for the iTzanavaris et alJ (|20071 ) and 
I Murphy et al.l ( 20031 ) data sets calculated from Tableware 
(2.27,2.51). These results are at variance with the result 
obtained comparing mean values for the whole interval. 

4. Discussion and Conclusion 



In Sect. [3] we have obtained two consistency redshift in- 
tervals ((0. 45,1.14) and (2 03,2 .3)), when compar i ng; th e 
data sets of lMurphv et al.l (|2003h and lChand et al.l (|2004l ). 
Now, we apply the modified student test defined in Sect. 
12.11 to the two consistency intervals. We obtain A* = 0.28 
and A* = 0.42 respectively, which confirms the consis- 



tency of these two redshift intervals as analyzed using 
confidence intervals. In this case, we do not have to worry 
about the number of samples being equal, since the se- 
lection of the redshift interval is already done. It is also 
interesting to repeat the calculation of confidence inter- 
vals for different values of (3 (or n). Table [5] shows that 
for a higher value of /3, the consistency interval is larger 
than the interval obtained for a lower value of (3. We also 
performed the modified student test for the consistency 
intervals of table [9] and obtain A* > 0.25. We have also 
calculated the consistency intervals for the other groups 
of data for larger values of /?. We find that there is con- 
sistency between the confidence interval and the reported 
interval for (3 — 0.05 and (3 = 0.1 in all cases but the 
comparison of the data set from lMurphv et a l. (2003) and 
Tzanavaris et al For this case, the redshift con- 



sistency interval for (3 = 0.05 is (0.238, 1.78)(2. 17, 2.47) 
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Fig. 1. Results of modified student test comparing results from lChand et al.l (|2005l ) with iMurphv et al.l (|2001bf ) 
Table 4. Confidence intervals for different rcdshifts. 



z 


Az 


Reported Interval 


Reference 


Confidence Interval 


Reference 


(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


z = 1.15 


0.24 


(-0.091,0.077) 


(1) 


(-1.4,-0.06) 


(7) 


z = 1.15 


1.5 


(-0.091,0.077) 


(1) 


(-0.13,0.13) 


(8) 


z = 1.15 


0.24 


(-0.5,0.42) 


(2) 


(-1.4, -0.06) 


(7) 


z = 1.15 


1.4 


(-0.5,0.42) 


(2) 


(-0.13,0.13) 


(8) 


z = 1.15 


0.24 


(-0.19,0.29) 


(3) 


(-1.4,-0.06) 


(7) 


z = 1.15 


1.5 


(-0.19,0.29) 


(3) 


(-0.13,0.13) 


(8) 


z = 1.84 


0.55 


(0.29,0.79) 


(4) 


(-1.21,0.56) 


(7) 


z = 1.84 


1.8 


(0.29,0.79) 


(4) 


(-0.13,0.15) 


(8) 


z = 0.69 


0.32 


(-0.19,0.35) 


(5) 


(-0.87,0.53) 


(7) 


z = 0.765 


0.26 


(0,0.54) 


(6) 


(-0.79,0.51) 


(7) 


2 = 0.685 


0.33 


(-0.02,0.24) 


(6) 


(-0.99,0.49) 


(7) 



Notes.- Columns: (1) redshift; (2) redshift interval for which the confidence interval is calculated; 

(3) single value reported by one author in units of 10 ; (4) reference of column (3); 

(5) calculated confidence interval from a group of data of another author in units of 10 -5 ; 

(6) reference of column (5). 

References: (1) Levshakov et al. (2006) : (2) Quasi et al. (2004) : (3) Chand et al. (2006) 

(4 ) Levshakov et al. (2007); (5) Murphy et al. (2001a); (6) Kanekar et al. (2005) 

(7) Murphy et al. (2003) : (8) Chand et al. (2004) 



and for = 0.1 we obtain (0.238, 1.79)(2.14, 2.47). On the 
other hand, we would like to comm ent o n the compar- 
ison between the Chand et al. (2005) and IMurphv et al.l 
( 2001bl l data sets. In this case, both the modified student 
test and the calculation of confidence intervals were used 
to analyze consistency with different results. However, it 
should be stressed that the modified student test is able 
to test redshift intervals larger than 0.76, while the length 
of the confidence intervals lies between 0.26 and 0.53. 

A statistical criterion has been given to estimate the 
appropriate sample sizes (which determine the width of 



the redshift intervals) . The criterion is based on the prob- 
ability of not rejecting the null hypothesis (consistency 
of two experiments compared) when a definite alternative 
hypothesis is supposed to be true. While being well justi- 
fied on statistical grounds, this proposal involves a subtle 
point: The choice of a given alternative hypothesis is not 
provided by a theoretical fr amework. Rather, ou r choice 
is suggested by the results of Murphy et alj ( 20031 ). which 
motivated our analysis. In that work, the non null result 
for the variation of a was supported by a departure from 
the null hypothesis 6 ~ 4.5 a — 4.5S/y/n (S the sample 
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Table 3. Confidence intervals for different redshifts. Table 5. Confidence intervals for different redshifts . 



z 


Az 


Reported Interval 
(3) 


Confidence Interval 










(1) 


(2) 


(4) 


z 


Az 


Reported Interval 


Confidence Interval 


0.452 


0.78 


(-0.30,0.70) 


(-0.96,0.50) 


(1) 


(2) 


(3) 


(4) 


0.822 


0.21 


(-0.90,0.90) 


(-0.78,0.24) 


2.01 


0.41 


(-1.04,0.28) 


(-1.72,0.68) 


0.859 


0.23 


(-0.50, -0.10) 


(-0.80,0.34) 


2.06 


0.44 


(-2.13,1.23) 


(-1.57,0.81) 


0.873 


0.23 


(-0.20,0.20) 


(-0.62,0.43) 


2.12 


0.40 


(-1.09,0.39) 


(-1.43,0.88) 


0.908 


0.20 


(-0.80,0) 


(-0.74,0.25) 


2.14 


0.37 


(-2.56,-0.02) 


(-1.39,0.82) 


0.942 


0.18 


(-1.90, -0.50) 


(-0.88,0.19) 


2.20 


0.40 


(-1.26,0.08) 


(-1.33,0.91) 


1.182 


0.27 


(-0.80,0.80) 


(-1.51,-0.20) 


2.28 


0.37 


(0.44, 1.64) 


(-1.56,0.39) 


1.243 


0.22 


(-0.20, 0) 


(-1.18,-0.13) 


2.30 


0.39 


(-1.03,0.43) 


(-1.64,0.49) 


1.277 


0.27 


(0.,0.20) 


(-1.16,-0.20) 


2.31 


0.41 


(-0.14,1.94) 


(-1.64,0.50) 


1.348 


0.35 


(-1.00, -0.20) 


(-1.17,-0.31) 


2.46 


0.33 


(-0.81, -0.09) 


(-1.52,0.36) 


1.439 


0.42 


(-0.50,0.50) 


(-1.14, -0.41) 


2.53 


0.44 


(-0.27,1.03) 


(-1.44,0.38) 


1.541 


0.51 


(-0.20,0.20) 


(-1.08,-0.40) 


2.62 


0.38 


(-0.90, -0.30) 


(-1.55,0.23) 


1.555 


0.48 


(-0.30, 0.70) 


(-0.97,-0.21) 


2.64 


0.40 


(-1.04,1.08) 


(-1.14,0.52) 


1.636 


0.59 


(-0.50,0.90) 


(-1.21,-0.25) 


2.77 


0.51 


(0.70,2.10) 


(-1.63,0.61) 


1.637 


0.59 


(0..1.20) 


(-1.21,-0.25) 


2.81 


0.54 


(-0.44, 1.04) 


(-1.63,0.61) 


1.657 


0.60 


(-0.20,0.80) 


(-1.16,-0.14) 


2.85 


0.60 


(-0.14,0.66) 


(-1.58,0.67) 


1.858 


0.49 


(0.,0.80) 


(-1.38,0.47) 


2.90 


0.69 


(-2.18,0.) 


(-1.48,0.69) 


1.915 


0.38 


(0.50, 1.10) 


(-1.33,0.76) 


2.91 


0.71 


(-0.67,1.33) 


(-1.48,0.69) 


2.022 


0.44 


(-0.50,0.30) 


(-1.72,0.69) 


2.98 


0.82 


(-1.35,2.19) 


(-1.97,0.59) 


2.168 


0.41 


(-0.40, 0.40) 


(-1.41,0.78) 


3.02 


0.84 


(-1.08,0.28) 


(-1.97,0.59) 


2.185 


0.38 


(-0.10,0.50) 


(-1.33,0.91) 


Notes: Column: (1) redshift; (2) length of the redshift interval 


2.187 


0.38 


(-0.40,0.) 


(-1.33,0.91) 


for which the confidence interval is 


calculated; (3) single value 


2.300 


0.389 


(-0.80,0) 


(-1.64,0.50) 


reported bv MurDhv et al. (2001b") 


in units of 10 -5 ; (4) calculated 









Notes.- Columns: (1) redshift; (2) length of the redshift 
interval for which the confidence interval is calculated; 
(3) single value reported by Cha nd et al. (2004) 
in units of 10~ 5 ; (4) calculated confidence interval 
from a group of data reported by Murphy et al. (2003) 
in units of 10 -5 . 



variance); our choice for the alternative hypothesis 5 ~ S 
is consistent with this for the employed sample sizes. 

We thus have analyzed the consistency between data 
sets on varying a, using the statistical tools described in 
Sect. [2] Usually, results are taken to be means over a range 
of redshifts. Here, instead, we have analyzed smaller in- 
tervals. The criterion for the selection of intervals is based 
on limits on the probability of type II error (see section 
12. 3[) . We h ave obtained two con siste ncy intervals for th e 
data set of IMurphv et alj (|2003t) and I Chand et all (|2004) , 
while the mean values and errors of the whole interval 
sh ow no consistency . We have also shown that the data 

(|2003f) are 



confidence interval from a group of data reported by 
Murphy et al. (20031 in units of 10" 5 . 



which should be used when testing theories on varying a. 
For the other data sets, the consistency depends on the 
desired value of j3. The result for (3 = 0.025 is also at vari- 
ance with the one obtained calculating the mean values 
and errors over the whole interval. 
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of iTzanavaris et al.l (|2007l ) and IMurphv et al. 
not consistent when tested over small redshift intervals. 
We have also obtained consistency intervals for this case, 
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Table 6. Confidence intervals for different redshifts. Table 7. Confidence intervals for different redshifts. 



z 


Az 


Reported Interval 


Confidence Interval 


(1) 


(2) 


(3) 


(4) 


1.597 


0.53 


(-2.92,1.72) 


(-1.20,-0.32) 


1.908 


0.39 


(0.92, 15.08) 


(-1.33,0.76) 


1.915 


0.38 


(-18.34,6.74) 


(-1.33,0.76) 


1.970 


0.35 


(-6.74, 24.34) 


(1.61,0.61) 


1.973 


0.36 


(-2.64,2.04) 


(1.61,0.61) 


1.976 


0.36 


(-10.60, 10.00) 


(-1.45,1.05) 


2.168 


0.41 


(-2.39,3.39) 


(-1.41,0.78) 


2.185 


0.38 


(-2.60,1.00) 


(-1.33,0.91) 


2.329 


0.44 


(-13.44,2.24) 


(-1.92,0.27) 


2.451 


0.35 


(-0.92,2.92) 


(-1.46,0.36) 


2.455 


0.34 


(-6.07,2.07) 


(-1.52,0.36) 


2.456 


0.34 


(-4.83,5.23) 


(-1.52,0.36) 


2.464 


0.32 


(-1.08,3.08) 


(-1.52,0.36) 


2.493 


0.36 


(-1.09,10.49) 


(-1.44,0.38) 


2.828 


0.58 


(-5.76,5.16) 


(-1.63,0.61) 



Notes.- Columns: (1) redshift; (2) lenght of the redshift 
interval for which the confidence interval is calculated; 
(3) single value reported by Chand ct al. (2005) 



in units of 10 -5 ; (4) calculated confidence interval 
from a group of data reported by Mur phy et al. (2003) 
in units of 10 . 



z 


Az 


Reported Interval 


Confidence Interval 


(1) 


(2) 


(3) 


(4) 


1.597 


0.53 


(-2.92,1.72) 


(-1.20,-0.32) 


1.908 


0.37 


(0.92, 15.08) 


(-1.17,0.41) 


1.915 


0.36 


(-18.34,6.74) 


(-1.17,0.41) 


1.970 


0.34 


(-6.74, 24.34) 


(-1.31,0.29) 


1.973 


0.33 


(-2.64,2.04) 


(-1.27,0.35) 


1.976 


0.33 


(-10.60, 10.00) 


(-1.27,0.35) 


2.168 


0.26 


(-2.39,3.39) 


(-0.74,0.47) 


2.185 


0.25 


(-2.60,1.00) 


(-0.83,0.47) 


2.329 


0.32 


(-13.44,2.24) 


(-0.72,0.34) 


2.451 


0.28 


(-0.92,2.92) 


(-0.85,0.38) 


2.455 


0.29 


(-6.07, 2.07) 


(-0.86,0.38) 


2.456 


0.29 


(-4.83,5.23) 


(-0.79,0.36) 


2.464 


0.31 


(-1.08,3.08) 


(-0.79,0.36) 


2.493 


0.26 


(-1.09,10.49) 


(-0.87, -0.04) 


2.828 


0.38 


(-5.76,5.16) 


(-0.50,0.74) 



Notes.- Columns: (1) redshift; (2) length of the redshift 
interval for which the confidence interval is calculated; 
(3) single value reported by Chand ct al. (2005) 



in units of 10 ; (4) calculated confidence interval 
from a group of data reported by ^lurphy ct al. (2001b) 
in units of 10 . 



Table 8. Confidence intervals for different redshifts. 
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Table 9. Consistency intervals obtained for different val- 
ues of j3 (A = 0.025 for all cases). 



p 


n 


Consistency Intervals 


ni 


n 2 


0.1 


12 
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12 


69 


0.05 


15 
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12 


67 


0.025 


18 


(0.45, 1.14)(1.92, 2.30) 


11 


67 



Notes.- m is the number of data reported by 



Cha nd et al. (2004) that are consistent with ri2 data 
from Murphy et al. (20031 . 
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